

The dataset contains 6 text files and 1506 screenshots of webpages.

All text files contain tab-separated data tables. One file contains demographic information of participants, with different participants differentiated using IDs (column 'uid'). These IDs correspond to the column names in the other 5 text files, which contain participants' evaluations of webpage visual complexity, aesthetics, technical condition, novelty and design craftsmanship. Each participant evaluated only 1 of these 5 webpage qualities (hence, different user IDs across the 5 files) and only for 100 out of 1506 webpages - the missing scores are put as NA (stands for 'Not Available' in R).

The 1506 screenshots are a combination of 5 previously published and 1 new dataset. The past datasets are (screenshots names begin accordingly):

IJHCS_12: Tuch, A. N., Presslaber, E. E., StöCklin, M., Opwis, K., & Bargas-Avila, J. A. (2012). The role of visual complexity and prototypicality regarding first impression of websites: Working towards understanding aesthetic judgments. International journal of human-computer studies, 70(11), 794-811.

CHI_13: Reinecke, K., Yeh, T., Miratrix, L., Mardiko, R., Zhao, Y., Liu, J., & Gajos, K. Z. (2013, April). Predicting users' first impressions of website aesthetics with a quantification of perceived visual complexity and colorfulness. In Proceedings of the SIGCHI conference on human factors in computing systems (pp. 2049-2058). ACM.

AVI_14: Miniukovich, A., & De Angeli, A. (2014, May). Quantification of interface visual complexity. In Proceedings of the 2014 international working conference on advanced visual interfaces (pp. 153-160). ACM.

CHI_15: Miniukovich, A., & De Angeli, A. (2015, April). Computation of interface aesthetics. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (pp. 1163-1172). ACM.

ICWE_19: Boychuk, E., & Bakaev, M. (2019, June). Entropy and Compression Based Analysis of Web User Interfaces. In International Conference on Web Engineering (pp. 253-261). Springer, Cham.

CHI_20 is a prefix for the new, previously not used/published webpages

All webpages are identified using artificial, numerical IDs, and are not explicitly linked to the websites they were collected from.